Design and Implementation of A Web Mining Research Support System

نویسندگان

  • Jin Xu
  • Gregory Madey
  • Patrick Flynn
چکیده

by Jin Xu The evolution of the World Wide Web has brought us enormous and ever growing amounts of data and information. With the abundant data provided by the web, it has become an important resource for research. Design and implementation of a web mining research support system has become a challenge for people with interest in utilizing information from the web for their research. However, traditional data extraction and mining techniques can not be applied directly to the web due to its semi-structured or even unstructured nature. This proposal describes the design and planned implementation of a web mining research support system. This system is designed for identifying, extracting, filtering and analyzing data from web resources. This system is composed of several stages: Information Retrieval (IR), Information Extraction (IE), Generalization, and Analysis & Validation. The goal of this system is to provide a general solution which researchers can follow to utilize web resources in their research. Some methods such as Natural Language Processing (NLP) and Artificial Neural Networks (ANN) will be applied to design new algorithms. Furthermore, data mining technologies such as clustering and association rules will also be explored for designing and implementing the web mining research support system. IR will identify web sources by predefined categories with automatic classification; IE will use a hybrid extraction way to select portions from a web page and put data into databases; Generalization

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing a System for Trend Analysis of Users in Website Surfing in Iran Using Data Mining and Text Mining Algorithms

Background and Aim: As of the entrance of web surfing to the lifestyle of a vast majority of people in the society and the need for a more accurate social and cultural policy making in the field, authors intended to analyze the behavior of the society users in viewing different websites so as to help politicians and practitioners. Methods: Design science research method is used in this research...

متن کامل

A Research Support System Framework for Web Data Mining

Design and implementation of a research support system for web data mining has become a challenge for researchers wishing to utilize useful information on the web. This paper proposes a framework for web data mining support systems. These systems are designed for identifying, extracting, filtering and analyzing data from web resources. They combines web retrieval and data mining techniques toge...

متن کامل

A comparative study of two meta-heuristic algorithms in optimizing cost of reinforced concrete segmental lining

In this work, we tried to automatically optimize the cost of the concrete segmental lining used as a support system in the case study of Mashhad Urban Railway Line 2 located in NE Iran. Two meta-heuristic optimization methods including particle swarm optimization (PSO) and imperialist competitive algorithm (ICA) were presented. The penalty function was used for unfeasible solutions, and the seg...

متن کامل

Use of Semantic Similarity and Web Usage Mining to Alleviate the Drawbacks of User-Based Collaborative Filtering Recommender Systems

  One of the most famous methods for recommendation is user-based Collaborative Filtering (CF). This system compares active user’s items rating with historical rating records of other users to find similar users and recommending items which seems interesting to these similar users and have not been rated by the active user. As a way of computing recommendations, the ultimate goal of the user-ba...

متن کامل

Design and implementation of an intelligent clinical decision support system for diagnosis and prediction of chronic kidney disease

Introduction: Chronic kidney disease (CKD) is one of the most important public health concerns worldwide. The steady increase in the number of people with End-stage renal disease (ESRD) needing a kidney transplant to survive and incur high costs, highlights early diagnosis and treatment of the disease. This study aimed to design a Clinical Decision Support System (CDSS) for diagnosing CKD and p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003